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Abstract 

In previous work, we have defined the telescopic relative entropy (TRE), which 
is a regularisation of the quantum relative entropy ^(pllfT) = Trp(logp — log cr), 
by replacing the second argument o" by a convex combination of the first and the 
second argument, r = a/? + (1 — a)a and dividing the result by — logo. We also 
explored some basic properties of the TRE. In this follow-up paper we state and 
prove two upper bounds on the variation of the TRE when either the first or the 
second argument changes. These bounds are close in spirit to a triangle inequality. 
For the ordinary relative entropy no such bounds are possible due to the fact that 
the variation could be infinite. 



1 Introduction 



The quantum relative entropy between two quantum states p and a, S{p\\a) = 
Tr p(Iogp — log 0"), is a non-commutative generalisation of the KuUback-Leibler 
distance between probability distributions and is widely used as a distance 
measure between quantum states [3]. One of its main drawbacks, however, is 
that the relative entropy is infinite when {p} ^ {a} (with {p} denoting the 
projector on the support of p). In particular, relative entropy is useless as a 
distance measure between pure states, since it is infinite for pure p and cr, 
unless p and a are exactly equal (in which case it always gives 0). 

To overcome this problem, in [T] we introduced a regularisation of the relative 
entropy, which we call the telescopic relative entropy (TRE): 
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Definition 1 For fixed a G (0,1), the a-telescopic relative entropy between 
states p and a is given by 

SaiplW) := — ^ S{p\\ap+{l-a)a). (1) 
- log (a) 

We showed that the value of the TRE is always between and 1; Sa{p\\cr) = 1 
if and only if p ± a. 

Furthermore, we have defined the limits a — )■ and a — 1 and have shown 
that these limits exist and can be expressed in closed form: 

Theorem 1 For any pair of states p, a, 

So{p\\a):=\imSM^) = 1 -TrpM (2) 
{p\\a) := hm 5,(p| |a) = 1 - Tr a{p}. (3) 

a— >1 

The origin of the name 'telescopic' is that the operation o" ap + (1 — a)a 
acts like a 'telescope' with 'magnification factor' 1/(1 — a), bringing the state 
a closer to the 'vantage point' p and bringing observed pairs of states Uj closer 
to each other. 

The main result of the present paper (see Section [3]) is the establishing of two 
upper bounds on the variation of the TRE, when either one of the arguments 
varies. In some sense these bounds could be considered as close relatives of 
the triangle inequality, while in another sense they could be considered as 
Fannes-type continuity inequalities. The two inequalities have no counterpart 
for the ordinary relative entropy, because the constants appearing in them 
would have to be infinite. The existence of reasonable bounds for the TRE is 
due to the telescoping process. 

In the next section we first collect the prerequisites, defining the basic no- 
tations and stating known relations for the relative entropy and the right- 
derivative of the operator logarithm. 



2 Preliminaries 

2. 1 Notations 

For any self-adjoint operator X on a Hilbert space "H, we denote by suppX 
the support of X, i.e. the subspace of % which is the orthogonal complement 
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of kerX, the kernel of X. The projector on the support of X will be denoted 

by {X}. 

For any self-adjoint operator X, X+ will denote the positive part X+ — [X -\- 
\X\)/2. It features in an expression for the trace norm distance between states: 

r(p,(j):=^||p-(7||i=IY(p-a)+. (4) 



The trace of the positive part has a variational characterisation as TrX+ = 
maxp Tr XP, where the maximisation is over all self-adjoint projectors. Hence, 
for all such projectors P, TyXP < TrX+. 

Two quantum states are mutually orthogonal, denoted p _L cr, iff Tr pa — 0. 



2.2 Gradients 



We will need the gradients of the relative entropy. Here we bring together all 
known facts. 

The following integral representation of the logarithm lies at the basis of much 
of the subsequent treatment. For x > 0, we have 

°° 1 1 



This immediately provides an integral representation for the ordinary relative 
entropy: 



S{p\\a) = -jds Tr + s)-' - {a + s)-'] (6) 



oo 

^- j dsTYp{p + s)-^ {a- p) {a + s)-\ (7) 



Likewise, we get similar expressions for the telescopic relative entropy: 



Sa{p\\o) 

1 °° 

I ds p[{p + s)-^ - {ap + {1 - a)a + s)-^] (8) 

)ga J 



log( 
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log 



1 °° 

— [ dsTip(p + syUl~a)(a- p) (ap + (l-a)a + sy\ (9) 
ea J 



Another integral we will encounter is ds x/(x + s)^. For a: = 0, the integral 
obviously gives 0. For x > it gives 1. Hence 



I ds ip + s)-'pip + s)-' = {p}. 



(10) 



The gradient of the relative entropy w.r.t. its first argument is defined through 
the relation 



TrA Vi^(A||5) = | 



S{A + tA\\B) 



t=Q 



:TrA(log(A) -log(S)) + TrA— 



log(A+^A). 



t=o 



The gradient of the relative entropy w.r.t. its second argument is defined 
similarly through 



TrA V2^(A||5) = ^ 



SiA\\B + tA) 



t=o 



-TtA 



d_ 



log(5 + tA). 



Hence, to find explicit expressions, the derivative of the logarithm is needed. 
From integral representation ([5]) we get 



d 
It 



\og{A + tA) = j ds {A + sl)-^A{A + sT)-\ 



It will be useful to introduce the following linear map, for A >0: 



rA(A) = Jds {A + sI)-^A{A + sI)-\ 



Thus 



d 
di 



log(A + tA) = r^(A). 



(12) 



t=o 



It's easy to check that for A > 0, Ta{A) = {A}. Thus, for A > 0, we have 
TAiA) = I. 

From the integral representation it also follows that, for any self-adjoint A, 
Ta preserves the positive semidefinite order: ii X <Y , then Ta{,X) < Ta{X)- 
By cyclicity of the trace, we see that the map 7a is self-adjoint: Tri?7A(A) = 
Ti^Ta{B). Moreover, the map is positive semi-definite, in the sense that 
Tr A7a(A) is positive for any self-adjoint A. This follows from the integral rep- 
resentation and the fact that for positive X and self-adjoint Y , Ti XYXY = 
Tr(XV2rxi/2)2 > 0. 

Further properties are discussed in [2]. In particular, the map {A,X) i— )■ 
Tt X*Ta{X) , for y4 > and any X, is jointly convex in A and X ([2], Theorem 
3). 

Using these properties of Ta one easily obtains (see also [3j, Chapter 3): 

Lemma 1 Let A and B he positive operators. The gradient of S{A\\B) = 
Tt A{log A — log B) w.r.t. A is given by 

V iS {A\\B) = log A- log B + {A}. (13) 



The gradient w.r.t. B is given by 

V2S{A\\B) = -rBiA). (14) 

The corresponding statement for the telescopic relative entropy is 

Lemma 2 Let A and B be positive semidefinite operators. The gradient of 
Sa{A\\B) w.r.t. A is given by 

V,SaiA\\B) = _^(log A - logC + {A} - aTciA)), (15) 
— log a 

with C = a A + [1 — a)B. The gradient w.r.t. B is given by 

V2Sa{A\\B) = -l-l^TBiA). (16) 
— log a 



Having defined the linear operator T via the first derivative of the logarithm, 
we can also define a quadratic operator 71 via the second derivative For 
A> and A self-adjoint, 

log(A + tA). (17) 
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A simple calculation using the integral representation of the first derivative 
yields the integral representation 



7^A(A) = 2 j ds{A + sl)-^/\{A + siy^A^A + sl)-\ 



2. 3 Basic properties of Telescopic Relative Entropy 



We have shown in [T] that the value of the telescopic relative entropy is always 
between and 1, even for non-faithful states. Furthermore, it inherits many 
desirable properties from the ordinary relative entropy, like positivity, the fact 
that it is only zero when p and a are equal (provided a > 0), joint convexity 
in its arguments, and monotonicity under CPT maps. 

The following identities are straightforward: 



hSa{X\\Y)=Sa{hX\\hY) (19) 
Sa{hX\\cX) = Sa{h\\c). (20) 

As we do not restrict the arguments of the telescopic relative entropy to states, 
the definition of TRE is also applicable to non-negative scalars: 

SMc) = ^-^^^^^^^^^^^^^^^^^. (21) 

— log a 



In particular, we have 

5„(6||0)=fe, 5,(0||c) = 0. (22) 



Both the ordinary relative entropy and the TRE satisfy certain monotonicity 
properties when applied to non-normalised positive operators: 

Lemma 3 For A,B,X > 0, 



S{A + X\\B + X)<SiA\\B) (23) 

SaiA + X\\B + X)<SaiA\\B) (24) 

S{A\\B + X)<S{A\\B) (25) 

SaiA\\B + X)<SaiA\\B). (26) 

Proof. The former two inequalities follow from joint convexity of S and Sa'- 
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Sa{A + X\\B + X)= 2Sa{{A + X)/2\\{B + X)/2) 
<Sa{A\\B) + Sa{X\\X) 
^Sa{A\\B). 



The latter two inequalities follow from operator monotonicity of the loga- 
rithm. □ 



3 Main results 

In this section, we present a number of highly non-trivial inequalities con- 
cerning the telescopic relative entropy. We like to point out here that these 
inequalities are just as much about the ordinary relative entropy, applied in 
the setting where cr is a convex combination of p and another state, and are 
therefore of interest regardless whether one wishes to use the telescopic relative 
entropy as an independent concept or not. 

While the telescopic relative entropy shares many properties with the ordinary 
relative entropy, and improves on certain undesired properties, just like the 
relative entropy it does not satisfy a triangle inequality in the strictest sense: 
SaipW^) ^ Sa{p\\<7) + Sa{<j\\T). However, due to the telescoping, inequalities 
can be proven that at least come close in spirit to a triangle inequality. 

Here we prove bounds on the difference between two telescopic relative en- 
tropies, the first between the distances from p to Ti and to T2, respectively, 
in terms of the trace norm distance between Ti and T2; the second between 
the distances from pi to r and p2 to r, in terms of the trace norm distance 
between pi and p2- These two bounds prove the continuity of the telescopic 
relative entropy in both of its arguments, in the sense of Fannes. 

First we state a triangle inequality w.r.t. the first argument: 

Theorem 2 For a e (0, 1), and for states pi, p2, o such that T{pi, P2) — t, 

l^a(Pllk) - Sa{p2\\a)\ < 1 - Sail - tm - Sam) 

^t-Sam)- (27) 

It is easily verified that equality is achieved for pi ^- a and p2 — ta + {1 — i) pi. 
For the ordinary relative entropy no such bound is possible, as can be seen 
by taking two different pure states for p and cxi, and a mixed state for (T2: for 
such a choice the difference |5'(p||(7i) — S'(p||(72)| becomes infinite. 

The proof of this theorem relies on the following proposition, which may be 
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of independent interest: 

Proposition 1 For A, B, X positive operators, with b = Ti B and x = Tt X , 

S{A\\A + X) > S{A + B\\A + B + X) > S{A\\A + X) + S{b\\b + x) (28) 

Next, we state a triangle inequality w.r.t. the second argument: 
Theorem 3 For a G (0, 1), and for states p, cxi, (T2 such that T{ai, (12) = t, 

\Sa{p\Wl) - SMCX2)\<1 - Sa{l\\t) 

1 — a 

< 1. 

—a log a 

When p ± o"i and a2 = tp + {1 — t)o'i, equality is achieved. This shows that 
the inequality is sharp, for any a and t. 

Again, for the ordinary relative entropy no such bound is possible, as can be 
seen by taking two different pure states for p and cxi, and a mixed state for 
(72: for such a choice the difference [^(pllai) — S'(p||(j2)| becomes infinite. 

Note that the coefficient (1 — a)/ (—a log a) is always greater than or equal to 
1. It tends to +00 in the limit a — )■ and to 1 in the limit a — )■ 1. This implies 
in particular that |S'i(p||(Ti) — S'i(p||cr2)| < T(a"i,cr2). That can also be seen to 
follow from Theorem [H as 

\Si{p\Wi) - Si{p\\a2)\ = I Tr{p}(ai - (T2)| 

< Tr((Ti - (T2) + 
= T((Ti,0-2). 

The proof of Theorem [3] relies on the following proposition, which is a coun- 
terpart of Proposition [1) 

Proposition 2 For A,B,X > 0, with b = Ti B and x = Ti X, 

S{X\\A + X) > S{X\\A + B + X) > S{X\\A + X) + S{x\\b + x). (31) 

Equivalently, for every state p and all A,B>{], with b = Ti B, 

< Tr p(log(p + A + B)- log(p + A)) < log(l + b). (32) 

Finally, we consider the remaining case, and 5*1: 
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(29) 
(30) 



Theorem 4 The telescopic relative entropy So{p\\(j) is continuous in p but 
discontinuous in a. For Si{p\\cr) the opposite situation holds. 

Proof. This can be seen immediately from the closed form expressions of 
Theorem [T] Linearity in one argument obviously implies continuity in that 
argument. On the other hand, the function that maps a state to the projector 
on its support is discontinuous, and this shows discontinuity in the other 
argument. □ 



4 Proofs 



In this section we will prove the stated triangle inequalities. The centerpiece 
in these proofs is the following proposition: 

Proposition 3 For A,B,X > 0, with b = TiB, x = TiX, 

bx 

< TiXTA+xiX) - Tr XTa+b+x W < -—. (33) 

b + X 



To prove this we first need some lemmas. 
Lemma 4 For A, B > 0, 

Tt{A + B)nA+B{A) > Ti ATaMA). (34) 

Proof. It has been proven in [2] that for A, B > and K, M self-adjoint, 

-Tt BTlAiK) + 2Tt MTAiK) <1:iMTb{M). 

The inequality of the lemma follows by replacing A and B hy A + B, and K 
and M hy A. □ 

Lemma 5 For A, i? > 0, 

nA+B{A) < I. (35) 



Proof. We use the integral representations of T and TZ. Since A + B + sl > B, 
we have {A + B + sl)'^ < B'^ and B{A + B + sI)-'^B < B. Therefore, 



TlA+BiB) = 2 J ds {A + B + sl)-^ B{A + B + siy^B {A + B + sl)"^ 
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oo 



<2 ds {A + B + sl)-^ B (A + B + sI) 







= 2rAMB). 



It has been proven in [2] that, for A> and A self-adjoint, 



I + 2rA{A) + 7^A(A) = 7^A(A + a) > o. 



Therefore, 



TZaMA + B-B)=I + 2Ta+b{-B) + TlA+Bi-B) 

= l~2TA+B{B)+nA+B{B). 



Thus, indeed, 7^A+B(A) < I. □ 

Lemma 6 Let f{t) he a real-valued convex function on [0,1]. If, moreover, 
/(O) < and /(O) < /'(O), then Vt G [0, 1], /(O) < (1 - t)/(t). 

Proof. Since /(O) < 0, for all t E [0, 1] we have /(0)/(l - 1) < /(O) < /'(O). 
Muhiplying both sides by t(l-t) yields t/(0) < t(l-t)/'(0). Adding (l-t)/(0) 
to both sides gives /(O) < t(l - t)/'(0) + (1 - t)/(0) = (1 - t)(/(0) + 1/'(0)). 
By convexity of /, /(O) + t/'(0) is a lower bound on /(t), and the inequality 
of the lemma follows. □ 

Proof of Proposition [Si The first inequality in fl33|) easily follows from the 
fact that X H-> 1/x is operator monotone decreasing together with the identity 



and monotonicity of Tr . The second inequality involves more work. 
Let us introduce an operator G such that G > p, and consider the function 

fit) = Tt pTta+{l-t)G{p) - 1- 

We will first show that (1 - > /(O) for < t < 1. 

The derivative /'(O) can be calculated explicitly from the integral representa- 
tion of T: 



oo 




/\A + XI)-'X'/y, 







/'(O) 



d 
dt 



t=o 



Tl prta+il-t)G{p) 




dx Tr p{G + t{a - G) + xI)-^p{G + t{a - G) + xl) 



-1 
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= - J dx Tr[p(G + xl)-\a - G){G + xI)-^p{G + xl)'^ 



+p{G + xI)-^p{G + xl)-\a -G){G + xiy^] 
= Tr(G-a)7^G(p). 

By combining the inequalities of Lemma's [Hand [5] we obtain 

Tr(G'-a)7^G(p)>TrprG(p)-l, 

which proves that /'(O) > /(O). In t = 0, / takes the value Tr pTaip) — 1, 
which is non-positive, since Tg{p) < Tg{G) = I. Thus /(O) < 0. Moreover, 
by convexity of the map (p, G) ^ Ti pTcip), fit) is convex. By Lemma [6] 
these three statements imply that (1 — t)f{t) > /(O), for < t < 1, i.e. the 
minimum of (1 — t)f{t) over [0, 1] occurs in t = 0. 

Now let t = h/(h + x), which indeed takes values in the interval [0, 1], and 
is iff 6 = 0. Then l-t = x/{h + x). Also, let G = p + A/x. With these 
substitutions, we get 



X{l-t)f{t) = TTXpTba+xp+A{xp) 



b + X 

and we therefore find that this expression is minimal for 6 = 0. That is, 

Trxp7^<^+^p+^(a;p) - ^-p-^ > Tr xp7;p+A(a;p) - x, 
or, after rearranging terms, 

bx 



TrXpTA+xpixp) - TlXpTA+ba+xpixp) < 



b + X 



By substituting X = xp and B = ba we obtain the second inequality of the 
proposition. □ 

This proposition now allows us to prove Propositions [Hand [21 and subsequently 
Theorems [2] and [3l 

Proof of Proposition [H We want to show 

S{A\\A + X) > S{A + B\\A + B + X)> S{A\\A + X) + S{b\\b + x). 
The first inequality is Lemma [31 

The second inequahty is proven using the second inequality in ( 133|) : 

bx 

TiXTA+xiX) - TiXTA+B+xiX) < -—. 

b + X 
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On replacing X by tX and dividing both sides by t, we get 

i:iXTA+B+tx{tX) - Ti XTA+tx{tX) > (36) 

b + tx 



When A > A + B + tX and A + tX are positive for all t, so that 
TA+B+tx{A + B + tX) = TA+txiA + tX) = I. 

Therefore, 

Tr XTA+B+tx{tX) - TiXTA+txitX) 

= - TiXTA+B+txiA + B) + TiXTA+txiA) 

= - Ti{A + B)rA+B+tx{X) + Tr ATA+txiX) 

= ^ (- Tr(A + B) log(A + B + tX) + TtA \og{A + tX)) . 
Thus the inequality fl5B]) becomes 

u h'T 

— (- Tr(A + B) log{A + B + tX) + Tr A log(A + tX)) > 



dt b + tx' 

for all t > 0. Integrating over t G [0, 1] this turns into 



(- Tt{A + B) \og{A + 5 + X) + Tr(A + B) \og{A + B)) 
+ (Tr ^ log(A + X) - Tr A log A) 

— bx 

j—j^ = K^ogb-logib + x)), 
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which is the inequality of the proposition. 

By a standard continuity argument, this is also true for A> 0. □ 
Proof of Theorem [2]. 

W.l.o.g. we assume that Sa{pi\\(j) is not less than Sa{p2\W), so that the abso- 
lute value signs can be removed. Changing signs, we will prove 

Sa{p2\W)-Sa{pi\W)>Sam)-t. 

Introduce A = p2 — Pi, then t = T(pi, P2) = Tr A+. Noting that p2 = pi + A = 
pi + A+ — A_, Proposition [1] yields 



'S'(P2||P2 + - — ^o") - S{pi\\pi + - — -a) 
a a 
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> S{pi + A+IIpi + A+ + - — -a) - S{pi\\pi + - — -a) 

a a 

>Sit\\t+{l-a)/a). 

Here we successively used the first inequality of Proposition [1] with S = A_, 
and its second one with B = A_|_. After some elementary algebra concerning 
the constant a, we get the required inequality. □ 

Proof of Proposition [21 We proceed in similar fashion as in the proof of 
Proposition [H Using the identity Tx{X) = I for X > 0, the inequality of 
Proposition [3] is equivalent with 

llT 

TiXiTA+B+xiA + B)- TaMA)) < 



b + X 

Replacing A by tA and B by tB and dividing both sides by t gives 

+B)+X {A + B)-rtA+x{A))<-— 
Integrating w.r.t. t over [0, 1] then yields the inequality of Proposition [2j □ 
Proof of Theorem [3l With s = (1 — a) /a, and Xj = ap + (1 — a)ai, 



S{p\\ti) - S'(p||r2) = Trp(log(ap+ (1 - 0)0-2) - log(ap + (1 - a)ai)) 
= Trp(log(p + 50-2) - log(p + sai)) 
= Trp(log(p + sai + 5(0-2 - cri)) - log(p + sai)) 
<Trp(log(p + so-i + s{a2 - cri)+) - log(p + sori)). 

In the last line we used the operator monotonicity of the logarithm and the 
fact that X+ > X. With the identification A = sai and B = 5(0-2 — o"i)+, 
inequality (l32l) yields 



SaiplWi) - Sa{p\W2) < Tr p(log(p + A + B)~ log(p + A))/(- log a) 
<log(l + sTr((T2 - cri)+)/(-loga) 
= (log(a + (1 - a)T(ai, ^2)) - log(a))/(- loga). 

□ 
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